Prediction Improvement using Optimal Scaling on Random Forest Models for Highly Categorical Data

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predicting disease risks from highly imbalanced data using random forest

BACKGROUND We present a method utilizing Healthcare Cost and Utilization Project (HCUP) dataset for predicting disease risk of individuals based on their medical diagnosis history. The presented methodology may be incorporated in a variety of applications such as risk management, tailored health communication and decision support systems in healthcare. METHODS We employed the National Inpatie...

متن کامل

Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest

Background & objective: Microarray and next generation sequencing (NGS) data are the important sources to find helpful molecular patterns. Also, the great number of gene expression data increases the challenge of how to identify the biomarkers associated with cancer. The random forest (RF) is used to effectively analyze the problems of large-p and smal...

متن کامل

Three-stage inversion improvement for forest height estimation using dual-PolInSAR data

This paper addresses an algorithm for forest height estimation using single frequency single baseline dual polarization radar interferometry data. The proposed method is based on a physical two layer volume over ground model and is represented using polarimetric synthetic aperture radar interferometry (PolInSAR) technique. The presented algorithm provides the opportunity to take advantages of t...

متن کامل

Process Improvement of Experimental Measurements Using D-optimal Models

In this paper, the application of D-optimal models, as an alternative to response surface models (RS models) for design of experiment (DOE) was examined. Two D-optimal models for tilt-rotors in the wind tunnel experiment, as a form of quadratic functions, were generated based on a chosen optimality criterion. This optimality criterion was used to generate the optimized sampled points in the des...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of Computer Applications

سال: 2014

ISSN: 0975-8887

DOI: 10.5120/18895-0183